basic knowledge
HFT: Half Fine-Tuning for Large Language Models
Hui, Tingfeng, Zhang, Zhenyu, Wang, Shuohuan, Xu, Weiran, Sun, Yu, Wu, Hua
Large language models (LLMs) with one or more fine-tuning phases have become a necessary step to unlock various capabilities, enabling LLMs to follow natural language instructions or align with human preferences. However, it carries the risk of catastrophic forgetting during sequential training, the parametric knowledge or the ability learned in previous stages may be overwhelmed by incoming training data. In this paper, we find that by regularly resetting partial parameters, LLMs can restore some of the original knowledge. Inspired by this, we introduce Half Fine-Tuning (HFT) for LLMs, as a substitute for full fine-tuning (FFT), to mitigate the forgetting issues, where half of the parameters are selected to learn new tasks while the other half are frozen to remain previous knowledge. We provide a feasibility analysis from the perspective of optimization and interpret the parameter selection operation as a regularization term. Without changing the model architecture, HFT could be seamlessly integrated into existing fine-tuning frameworks. Extensive experiments and analysis on supervised fine-tuning, direct preference optimization, and continual learning consistently demonstrate the effectiveness, robustness, and efficiency of HFT. Compared with FFT, HFT not only significantly alleviates the forgetting problem, but also achieves the best performance in a series of downstream benchmarks, with an approximately 30% reduction in training time.
- North America > United States (0.14)
- Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
- Asia > China > Beijing > Beijing (0.04)
Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
Gao, Changjiang, Hu, Hongda, Hu, Peng, Chen, Jiajun, Li, Jixing, Huang, Shujian
Despite their strong ability to retrieve knowledge in English, current large language models show imbalance abilities in different languages. Two approaches are proposed to address this, i.e., multilingual pretraining and multilingual instruction tuning. However, whether and how do such methods contribute to the cross-lingual knowledge alignment inside the models is unknown. In this paper, we propose CLiKA, a systematic framework to assess the cross-lingual knowledge alignment of LLMs in the Performance, Consistency and Conductivity levels, and explored the effect of multilingual pretraining and instruction tuning on the degree of alignment. Results show that: while both multilingual pretraining and instruction tuning are beneficial for cross-lingual knowledge alignment, the training strategy needs to be carefully designed. Namely, continued pretraining improves the alignment of the target language at the cost of other languages, while mixed pretraining affect other languages less. Also, the overall cross-lingual knowledge alignment, especially in the conductivity level, is unsatisfactory for all tested LLMs, and neither multilingual pretraining nor instruction tuning can substantially improve the cross-lingual knowledge conductivity.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > Canada > Ontario > Toronto (0.04)
- Asia > China > Jiangsu Province > Nanjing (0.04)
- (28 more...)
The Courses You need to Succeed in your Computer Vision Career
The current demand for pursuing a career in the field of AI and computer vision is at an all-time high. As with various other aspects of the digital realm, a comprehensive understanding of these areas can be attained through online resources. It is often presumed that the quality of online courses could be better than traditional methods, such as college-level programs, practical experience in the field, and offline studies. However, online learning has advanced beyond this misconception. Paid and free online courses can teach fundamental computer vision principles and specific elements of the discipline.
- Education > Educational Setting > Online (1.00)
- Education > Educational Technology > Educational Software > Computer Based Training (0.59)
Study: ChatGPT has potential to help cirrhosis, liver cancer patients
A new study by Cedars-Sinai investigators describes how ChatGPT, an artificial intelligence (AI) chatbot, may help improve health outcomes for patients with cirrhosis and liver cancer by providing easy-to-understand information about basic knowledge, lifestyle and treatments for these conditions. The findings, published in the peer-reviewed journal Clinical and Molecular Hepatology, highlights the AI system's potential to play a role in clinical practice. "Patients with cirrhosis and/or liver cancer and their caregivers often have unmet needs and insufficient knowledge about managing and preventing complications of their disease," said Brennan Spiegel, MD, MSHS, director of Health Services Research at Cedars-Sinai and co-corresponding author of the study. "We found ChatGPT--while it has limitations--can help empower patients and improve health literacy for different populations." Patients diagnosed with liver cancer and cirrhosis, an end-stage liver disease that is also a major risk factor for the most common form of liver cancer, often require extensive treatment that can be complex and challenging to manage.
- Health & Medicine > Therapeutic Area > Oncology > Liver Cancer (1.00)
- Health & Medicine > Therapeutic Area > Hepatology (1.00)
Data Engineer
Launched in 1998, this pioneering British-born brand has specialised in creating amazing experiences and unforgettable memories - from hotels, city breaks and holidays to theatre, entertainment and spa days. Experts in brightening up online travel, lastminute.com is among the worldwide leaders in the field, helping hundreds of thousands of customers every year find, and do, "whatever makes them pink". Every month, the Group reaches across all its websites and mobile apps (in 17 languages and 40 countries) 60 million unique users that search for and book their travel and leisure experiences. More than 1,200 people enjoy working with us and contribute to provide our audience with a comprehensive and inspiring offering of travel related products and services. At the heart of our culture is a commitment of inclusion across race, gender, age sexual orientation, religion, gender identity or expression and accessibility.
Genetic Algorithm: A to Z with Combinatorial Problems
This is one of the most applied courses on Genetic Algorithms (GA), which presents an integrated framework to solve real-world optimization problems in the most simple way. For the first time, we have presented a practical course in the domain of metaheuristics algorithms required for students, researchers and practitioners. Firstly, we will introduce the basic theory of GA, then implement the simplest version of GA, namely Binary GA, into Matlab, and then present the continuous version, real GA, of it. Therefore, the main focus will be on the Genetic Algorithm as the most well-regarded optimization algorithm in the literature. In the following sections, we will introduce some well-known operation research problems, including transportation problems, hub location problems (HLP), quadratic assignment problems and travelling salesman problems (TSP) and try to solve them via GA.
Python for Data Science - NumPy, Pandas & Scikit-Learn
Welcome to the Python for Data Science - NumPy, Pandas & Scikit-Learn course, where you can test your Python programming skills in data science, specifically in NumPy, Pandas and Scikit-Learn. This course is designed for people who have basic knowledge in Python, NumPy, Pandas and Scikit-Learn packages. It consists of 330 exercises with solutions. This is a great test for people who are learning the Python language and data science and are looking for new challenges. Exercises are also a good test before the interview. Many popular topics were covered in this course.
What all you need to become a data scientist?
There is no single starting point or path you can follow to become a data scientist. You can start from anywhere -- from a science, engineering, commerce graduate, Ph. D degree and continue your journey with coding any kind of problem you see around, to attending online courses, participating in a Kaggle competition or doing a data science project under a mentor. Even there is no single starting point or path still there is set of common skills and passions that you must possess. Mathematics & reasoning comes first and along that you should have a passion for coding/programming and problem solving.
Trending online courses in business, data science, tech, and engineering
In this popular beginner-level Specialization, you'll develop management, leadership, finance, and digital marketing skills that can translate to the successful operation of a business. Learn the basics of running a business and develop new strategies for improving an organization's growth and profitability by analyzing financial statements, creating forecasting and budgeting, and embracing digital marketing best practices.
- Asia > India > Uttarakhand > Roorkee (0.06)
- North America > United States > New York (0.05)
- North America > United States > California > Orange County > Irvine (0.05)
- Education > Educational Setting > Online (1.00)
- Education > Educational Technology > Educational Software > Computer Based Training (0.50)
Top 3 Free Resources to Learn Linear Algebra for Machine Learning - KDnuggets
Mathematics is the core of all machine learning algorithms. And while it isn't a prerequisite to have formal math education in order to become a data scientist, you need to understand the principles of the subject well enough to successfully build models that add value. In an article I wrote previously, I explained the three branches of mathematics that were essential to gain a deeper understanding of ML algorithms -- statistics, calculus, and linear algebra. This article will solely focus on linear algebra, as it forms the backbone of machine learning model implementation. Linear algebra concepts like vectorization allow for faster computation speeds, and are implemented in libraries like Pandas, Scipy, and Scikit-Learn.